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Abstract—Descriptive and statistical age-period-cohort (APC) analysis methods have received 
considerable attention in the literature, The statistical modeling of APC data often involves the 
popular multiple classification model, a model containing the effects of age groups (rows), periods 
of observation (columns), and birth cohorts (diagonals of the age-by-period table). The 
identifiability problem inherent to this model is discussed, and its adverse effects on the results of 
APC modeling exercises are illustrated numerically. Potential problems attendant with the use of 
two-factor models ate described, and other possible modeling approaches currently in use are 
discussed. Interpretational limitations due to certain innate characteristics of typical APC data sets 
are also detailed. Given all the documented potential sources for error, the current state-of-the-art 
regarding the staiistical modeling of APC data should be considered to be at an early stage of 
development. 


1. INTRODUCTION 

AGE-PERIOD-COHORT (APC) analysis has been a popular epidemiologic tool since Frost [1] 
employed it in his classical study of tuberculosis. The procedure he developed is primarily 
descriptive, with graphs used to examine patterns in disease rates over time. 

In the past few years, however, many studies appearing in the epidemiologic literature 
have utilized a more analytical approach to the treatment of APC data, such an approach 
involving the statistical fitting of regression models designed to quantify the separate effects 
of the three factors age, period, and cohort. These investigations have encompassed several 
different diseases such as breast cancer [2,3], colon cancer [4], cancer of the cervix [5], 
prostate cancer [6, 7] bladder cancer [8, 9], and lung cancer [9-13]. Each study has adopted 
a regression analysis approach to the treatment of incidence or mortality data; typically, 
a three-factor model (age at occurrence of disease or death, time of occurrence of disease 
or death, and birth cohort), a two-factor model (usually, age at occurrence and birth 
cohort), or some modification of these two models has been employed. Many of these types 
of studies have resulted in epidemiologic statements being made about the etiology of the 
diseases under investigation. 

To an epidemiologist who is not thoroughly acquainted with the specifics of various 
statistical modeling approaches currently used to analyze APC data, the results and 
ultimate utility of such quantitative analyses can be quite hard to evaluate. In particular, 
some reasonable questions that a health researcher might ask about such modeling 
exercises are the following: How much credence can be given to the results of such 
statistical analyses? What are the possible sources of error associated with the use of 
various regression procedures for modeling APC data? Do such sophisticated modeling 
procedures truly provide interpretational advantages over traditional graphical 
approaches? 
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Our purpose in writing this paper is to review and critique the general area of 
age-period-cohort analysis and to discuss and illustrate some of the important limitations 
of popular statistical modeling approaches for analyzing APC data. We will argue that any 
interpretations regarding patterns in age, period, and cohort effects based on the use of 
such modeling procedures must be made with a great deal of caution. Furthermore, we 
will stress that any statistical modeling of APC data should be carried out in conjunction 
with a detailed descriptive analysis such as discussed by Kleinbaum et al. [14] and Glenn 
[15]. In this paper, we will deal with statistical APC methodology only as it applies to 
epidemiologic data, although APC analysis has been an integral part of many disciplines 
such as sociology, demography, developmenial psychology, political science, and 
economics [16-26]. 

2 . DESCRIPTIVE AGE-PERIOD-COHORT ANALYSIS 

A descriptive APC analysis of epidemiologic data first assembles disease mortality or 
morbidity rates in a two-way table with, say, the rows representing categories of age at 
occurrence and the columns defining categories of year of occurrence. In general notation, 
for the /th of a age groups and the yth of p periods, we will let = OiJN^j denote the 
observed rate in the (/.yjth cel! of such a table, rvhere O/j is the observed number of deaths 
or illnesses and N^j is the number of person-years at risk. 

An example of such a data layout is presented in Table 1 , which gives lung cancer 
mortality rates for United States white males. These rates are based on 5-year age intervals 
and S-year period intervals. (Although age and period intervals of the same width are not 
required, we will make this generally unrestrictive assumption to avoid certain esoteric 
mathematical complexities [7, 17].) 

The diagonals of this Table (going from upper left to lower right) define the lung cancer 
mortality rate patterns for successive groups of U.S. white males who were born together 
and hence age together. In Table 1, these so-called bjrih cohorts extend over 9-year 
intervals, and each such birth cohort is typically identified by its central birth year. For 
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Table I. Luno cancer MORTAum’ rates* per 100,000 and average numbers of luno cancer deaths ?er year 



FOR 

U.S. WHITE 

M.ALES BY 

YEAR (1931- 

■1975) AND 

BY AGE 130-84). BOTH IN 

FIVE-YEAR 

intervals 


Age 

group 





Calender period 




1931-35 

1936-40 

1941-43 

1945-50 

1951-55 

1956-60 

1961-65 

1966-70 

1971-75 

30-34 

Rate 

1.08 

1.43 

1.64 

1.58 

1.52 

1.96 

2,16 

2.14 

1.74 


Deaths 

45 

64 

78 

79 

SO 

104 

105 

103 

97 

35-39 

Rats 

2.38 

3.06 

3.66 

4,10 

4.32 

5.26 

6,35 

7.25 

7.21 


Deaths 

98 

127 

163 

197 

219 

282 

334 

357 

350 

40-44 

Rate 

4.51 

6.78 

8.46 

10,05 

11.75 

13.40 

16.07 

19.60 

19.87 


Deaths 

180 

271 

352 

447 

559 

673 

842 

1041 

990 

45-49 

Rate 

8.19 

13.72 

17,62 

22,31 

27.46 

30.75 

36.16 

41.06 

47.51 


Deaths 

288 

521 

695 

907 

1197 

1453 

1755 

2132. 

2458 

30-54 

Rate 

13.04 

21.76 

30.35 

43.32 

53.82 

64.29 

72.11 

81.43 

86.42 


Deaths 

391 

720 

1082 

1609 

2054 

2656 

3197 

3861 

4407 

55-59 

Rale 

16.14 

30.07 

47.06 

68.18 

88.37 

108.86 

121.86 

139.46 

151.86 


Deaths 

397 

812 

1391 

2201 

3050 

3939 

4788 

5961 

6743 

60-64 

Rate 

19.61 

34.75 

53.65 

87.38 

120.90 

158.32 

189.36 

218.81 

238.48 


Deaths 

381 

755 

1295 

2365 

3614 

4933 

5943 

7640 

9129 

65-69 

Rate 

20.73 

3S.22 

54.29 

87.47 

136.24 

189.11 

240.25 

289.07 

325,13 


Deaths 

301 

635 

1026 

1867 

3265 

4937 

6361 

7973 

9922 

70-74 

Rate 

20.77 

32.98 

50.36 

85.62 

128.85 

184.99 

245.18 

322.38.. 

390.94 


Deaths 

206 

370 

648 

1247 

2153 

3563 

5132 

6824 

8447 

75-79 

Rate 

19.09 

33.59 

44.96 

77.73 

115.21 

160.81 

224,70 

318.68 

415.37 


Deaths 

110 

218 

334 

680 

1186 

1906 

2983 

4591 

5976 

80-84 

Rate 

12.47 

23.49 

31.46 

64.03 

96.88 

132,16 

179.54 

260.91 

360.16 


Deaths 

36 

76 

114 

281 

495 

788 

1223 

2066 

3083 


‘The National Center for Health Statistics (NCHS) supplied the lung cancer mortality rates for the years 1950 through 1975. 
Sources for the remaining data are: U.S. Department of Commerce, Bureau of the Census; Estimates of the population of 
the United Slates by age, sex. and race, Current Population Reports, Series P-25, No. 311, 1931-1949; and U.S. Department 
of Health. Lducation and Welfare, Naiional Center for Health Statistics: Vital Statistics of the U.S., Annual Mortality 
Volumes, 1931-1949. Further details about the data can be found in; Janis JM: A Descriptive and Statistical Methodology 
for Age-Period-Cohori Analysis with Application to Lung Cancer. Unpublished doctoral dissertation. Deparrmcni of 
BLostaiistics, University of North Carolina, 1981, 
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example, the “1901 birth cohort” contains those males born during the 9-year period 1897 
through 1905; in Table 1, the diagonal containing the nine rates 1.08 x 10“^, 3.06 x 10“^ 
8.46 X iO"^,. ..,289.07 x 10“’, 390.94 x 10“’ gives the mortality rate pattern for this 
particular birth cohort. The diagonal just below this one contains the nine rates pertaining 
to the “1896 birth cohort" (i.e. those males born between 1892 and 1900). The reader 
should be aware of the fact that birth cohorts will “overlap” (i.e. will have certain years 
of birth in common) when defined in this manner; such overlap is typically ignored when 
fitting statistical models to APC data. 

Another important characteristic of this correspondence between birth cohorts and the 
diagonals of an age-by-period two-way data layout is that birth cohorts corresponding to 
diagonals at the extremes of the Table will involve very few data points (e.g. the 1851 and 
1941 birth cohorts contain only one observation each). We will say more later about the 
effects of varying birth cohort size upon the interpretation of estimated birth cohort effects. 

In a purely descriptive APC analysis, the rates (or transformations thereof) are plotted 
in various ways as a function of the age, period, and cohort groupings, one such way being 
that illustrated in Fig. 1. Here, the data in Table 1 are plotted with the rate per 100,000 
(i.e. 10®7?,y) as the ordinate and the age at death category as the abscissa (each age at death 
category being indexed fay the first year of the 5-year age interval). Successive periods of 
death are shown by solid lines, and successive birth cohorts by dashed lines. (To avoid 
confusing overlaps, some of the birth cohorts are not shown.) 

Although graphs such as Fig. 1 are helpful in obtaining general qualitative impressions 
about age, period, and cohort rate patterns, they have certain major limitations, For 
example, in Fig. 1, the pattern in the lung cancer mortality rate curve for the 1961-65 
period can be seen to increase steadily from age 30 up to about age 68, before starting 
to fall off. Note, also, that this curve cuts across a number of birth cohort curves (e.g. 1881, 
I 1886, 1891, etc.). Thus, the shape of this period curve is affected both by varying age effects 
[ and by varying cohort effects. Furthermore, a quantitative assessment of the way in vvhich 
' these age and cohort effects operate to influence the shape of this period curve cannot be 

: obtained by a simple visual examination of graphs like Fig. 1. Such quantification can only 

I be achieved via the use of valid statistical modeling procedures. 

( 

1 3. STATISTICAL AGE-PERIOD-COHORT ANALYSIS: 

GENERAL CONSIDERATIONS 

Greenberg et al. [27] first recognized the limitations of a descriptive analysis, and they 
proposed the use of a three-factor, analysis of variance-type model to quantify the 
“separate” effects of the (categorized) age, period, and cohort variables, In this paper, we 
will focus primarily on the so-called APC multiple classification model, a model which has 
been the most often used in practice, the most thoroughly studied in theory, and the most 
controversial in character. 

The APC multiple classification model has the specific structure 

= p + Kj + + (1) 

j ( = 1, 2,..., fl and y = 1, 2,.,., p; here, T,, = /{Rij) — represents some function 

■ of the observed rate R^j, p is the overall mean, a,- is the fixed effecl'of.the'ith age category, 
* fij is the fixed effect of the yth period category, and is the fixed cohort effect 

j associated with the ;th age category and the y'th period category. Note that there are 

f {a-\-p — \) distinct cohort effects defined by model (1) as i ranges from 1 to a and j from 

I 1 to p. The only random component on the right-hand side of equation (1) is £,y, which 

{ is assumed to have mean (or expected value) C(8;;) = 0. The variance and other distribu- 

t tional properties of s,, are tied to the assumptions made about the stochastic nature of 

' and hence of ^i,- (e.g. Of, is often assumed to be Poisson and Njj to be non-random). 

I For illustrative purposes, the special case u = 3, p = 4 is diagrammed in Table 2, below, 

I with a typical cell entry being £(Yfi = p -Fa, + jij + 
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FlO. 1. U. S. white male lung cancer mortality rates per 100,000 (seeTable 1) by age at death, period 

of death, and birth cohort. 


From the structure of model (I) and from an inspection of Table 2, it can be seen that 
the APC multiple classification model (I), as with more standard two-way ANOVA-type 
models, assumes that the age effects (x-s) are constant along rows, that the period effects 
(j8/s) are constant down columns, and that the cohort effects (y„_;+/s) are constant along 
the diagonals. Of course, model (1) is atypical due to the non-standard structure of the 
“cohort” term, a structure which we will interpret momentarily as a special form of 
age-by-period interaction effect. 

Model (1) is most appropriate for detecting subtle and unexpected patterns in age, 
period, and cohort effects since it does not assume any a priori specific functional 
relationships to which such patterns must conform. Such a priori structural assumptions 
(e.g. like requiring that the age effects follow some sort of polynomial curve) are generally 
hard to defend, and can actually restrict the sensitivity of model (1) for detecting important 
effects in the data. 

Without going into the mathematical details, it is important to emphasize that the cohort 
effects in model (I) characterize a very specific form of interaction between the (categorical) 
age and period variables. For a detailed mathematical discussion regarding this concept. 
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Table 1 . A diagrammatic representation of model (1) for the special case a = ‘ i , p = A 


Period group (jt) 


J=2 ;=3 


Age ‘ ^ -t- + V 3 

Group (0 i = ? 

1 = 3 p + xj + Hi + n 

4- a, -f- ^2 + V'a P + a, ^3 + yj 

+ «] + + 72 + 7j 

/I + a, + /?, + Td 
p+ci2 + ft + Vd 

P + 51) + ^4 + 'A 


the reader is encouraged to consult the paper by Kupper et al. [28]. For now, it is sufficient 
for the reader to appreciate the fact that the model 

~ p + + t^j (2) 

which contains no cohort effects, is a no-lnteraction model in the sense that the expected 
response in cell (/, j) is determined as a function of i and j solely by the marginal (or 
separate) effects of row i and column j, and not also by an effect (such as which 

is a function of both i and j (i.e. is cell-specific). 

To pursue this notion further in the context of a commonly used epidemiologic data 
analysis procedure, let us contrast models (I) and (2) with regard to the use of age 
adjustment by the direct method when assessing trends in incidence or mortality rates over 
time. In particular, let 

(s7),= t w,y,. 

i= I 

denote the directly age-standardized rate function for theyth period; the {W^} are weights 

(satisfying IF, = 1) 

I 

which vary only with / and which are usually based on the age distribution in some external 
standard population (e.g. the 1970 U.S. population). 

Under the no-interaction model (2), it follows, for j that 

so that a plot of the {(jF)^} vs y would, as desired, correctly display patterns in the period 
effects (ignoring, of course, the vagaries of sampling error). However, under model (1), for 

j 

£[(^y),- (.sF),.] = {^,- /(-.) + i 

f« I 

so that plots of the {(^F);} vs j would unfortunately reflect a complex and difficult to 
interpret mixture of varying period and cohort effects. 

Deviations from model (2) can sometimes be detected by making plots of the { F]y} vs 
} for each f (called age-specific plots). A lack of “parallelism” among these curves may 
suggest the influence of important “interaction” effects. Indeed, epidemiologists have often 
argued that the presence of such non-parallelism provides some evidence that real birth 
cohort effects are actually operating [29, 30]. As an example. Figs 2 and 3 give the age 
group-specific curves for the lung cancer data in Table 1; Fig. 2 uses itself, while 
Fig. 3 employs ln(10^.R„). There is evidence of non-parallelism in both figures, although 
the use of the fog transformation in Fig. 3 tends to dampen the effect somewhat. 

In this regard, an important point needs to be made about the choice of the function 
/ {Ri) and its impact upon the interpretation of estimated age, period, and cohort effects. 
Figures 2 and 3 help somewhat to illustrate graphically that trends in data (and hence 
estimated effects based on the modeling of such trends) are affected by the choice of 
dependent variable F,, =/(!?„) used for analysis. Numerous choices for/(.Ry) have been 
considered in the literature, some of the more popular ones being f(RiJ) = R,p 
f {Rijl = when .R,, > 0, /(Ry) = ln(l -f .R^) to permit consideration of observed rates 

equal to zero, and the logit transformation/(.R„) = ln[7?,y/(i — R,;)] for rates that have been 
suitably scaled to lie between 0 and I in value. 
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Fig. 2. Age-specittc plots of rate per 100,000; data obtained from Table 1. 


The reason that the choice for/(.^,y) has such an important role in APC analysis is that 
interaction quantification (and hence cohort effect estimation) is very much dependent on 
the scale used to measure such interaction. Kleinbaum et al. ([14], Chapter 19) provide a 
detailed discussion of the controversy in the biostatistical and epidemiological literature 
about methodological issues of interaction assessment and interpretation in health-related 
data. In particular, these authors emphasize the inadvisability of using summary rates [e.g. 
like the (sT)/s] where there is evidence of interaction. Freeman and Holdford [31] advocate 
looking for a transformation /(i^y) which will eliminate or at least suppress strong 
interaction effects in order to justify the use of “smoothed” summary indices. However, 
such a search may not be desirable in situations where the presence of certain interaction 
effects can suggest important causal mechanisms and so merit in-depth study. 

The fact that interaction quantification is so very much a model-dependent procedure 
often creates unresolvable interpretational problems regarding estimated interaction 
effects, a prime example being patterns in estimated cohort effects based on fitting models 
like equation (1) to APC data. Since cohort effects are age-by-period interaction effects 
of a very specific structure, it is clear that the choice of the function = f (.Ry) will 
significantly affect the patterns in estimated cohort effects based on modeling Ty as a 
function of the variables age, period, and cohort. In fact, it is not unreasonable to suspect 
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Year of death (first year of five years ) 

Fta. 3. Age-specific plots of In (rate per 100,000); data obtained from Table 1. 


I that, for a given APC data set, a transformation /(^y) might be found which would lead 

j to the (possibly inappropriate) conclusion that no important cohort effects are operating 

in that data set. The reader should bear in mind that the results of any statistical APC 
j analysis depend intimately on the choice of the function Ty —f (.Sy) which is to be modeled. 

I 

I 4. ESTIMATION OF MODEL (1) PARAMETERS 

i Having defined (and accepted as a reasonable approximation to the true state of nature) 

the APC multiple classification model (1), the statistical goal is then to estimate as 
accurately as possible the age, period, and cohort parameters in that rnoiiel. Once a set 
of estimates has been obtained by some procedure, it is standard practice to make separate 
plots of the estimated age, period, and cohort effects, and then to attempt to “interpret” 
etiologically any interesting patterns which may emerge. Since these age, period, and 
‘ cohort parameters appear together in the same model, it is natural to argue, for example, 
that the estimated age parameters have been “adjusted” (as in the spirit of analysis of 
1 covariance) for the effects of the period and cohort factors, with similar claims being made 

i for the period and cohort effect estimates. This statistical adjustment phenomenon, 

I whereby “pure” patterns in estimated age, period, and cohort effects can supposedly be 

: obtained, has served as the primary impetus for the popularity of regression models like 

' equation (1) when analyzing APC data. 
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As we have emphasized earlier, one major concern associated with modeling APC data 
using equation (1) is the choice for the transformation However, even if an 

appropriate specification for/(^i^) can be made, the utility of APC analysis is additionally 
seriously threatened by the so-called “identification problem”, 

4.1. The identification problem: the "'bete noire" of APC researchers 

The identification problem in APC analysis pertains to the fact that the factors age, 
period, and cohort are mathematically related. When these factors are treated as 
continuous variables, this mathematical relationship (when mortality is the endpoint) is 
simply “(year of birth) + (age at death) = (year of death).” For example, a person who is 
born in 1900 will have a lifespan of 50 years if he or she dies in 1950. The exact structure 
of the mathematical dependency among age, period, and cohort variables which have been 
categorized as in model (1) is harder to quantify; however, such a quantification is provided 
by Theorem 3.1 in Kupper et al. [28]. 

To appreciate more fully why the presence of such a linear dependency causes a problem 
when analyzing APC data (like that in Table 1) with the multiple classification model (1), 
it is helpful to work with an equivalent form of this model. In particular, let us define the 
parametric means 

1 a _ 1 p 1 a-rp-l 

a = - Z “ Z ft’ y = —r) Z yk- 

a^f, Pj=i {a+p-l 

Then, as with all fixed-effect ANOVA-type models, it is natural to reparameterize model 
(1) to its equivalent form 

y„ = //■* + «(♦+ P*+ y + «,y, (3) 

where p* = {p + S + ^ + y), ctf = (a,- a), 

ft* = (ft; - ft), and }'- y); 

clearly, then, we have 

a p a + p—I 

Z“r=Sft;= I y* = o- (4) 

i - i ; -1 * = l 

It is important to emphasize that the reparameterized model (3) simply re-expresses each 
effect in model (1) as a deviation from the mean of all effects of that type, and such 
centering creates no distortion with respect to assessing patterns in estimated effects. In 
contrast, the use [22, 32] of unnatural constraints like a, = ^, = y, = 0 does not lead to a 
straightforward equivalent representation of equation (1), and can produce misleading 
patterns in estimated coefficients. 

The restrictions (4) imply, for example, that only the first (u — 1) age effects, the first 
(p — 1) period effects, and the first (a+p—2) cohort effects in model (3) require 
estimation, since 

< = - z'^r, ft; = -z'ft; 




fr-*-p - 2 

■ Z y*- 

1 


Employing (without loss in generality) these three equalities defines the 
(ap) X [2(a -l-p) — 3] design matrix X* for the matrix representation of model (3), namely 


EfY) = X*|*. 


where the response vector is 



and the parameb: 
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Y' = (r„,..., Fi,; ..., n,;.,.; , F,,) 

and the parameter vector is 


= af, ... yf . yi^p- 2 )- 

Appendix A gives the exact form of X* for the special case a = 'i, p = 4 diagrammed in 
Table 2. 

The identification problem arises under model (5) because the matrix X* is one less than 
full rank (i.e. there exists an exact linear dependency among the columns of X*). The 
interested reader can consult Appendix A to see the exact structure of this linear 
dependency for the special case a = 3, p = 4, and also for general a and p. 

Assuming for now that the elements in are to be estimated by ordinary (unweighted) 
least squares, the normal equations to be solved for the vector of estimates are of the 
form 


X*'X*|* = X*'Y. 


( 6 ) 


Since X*''X* is one less than full rank, so that its inverse (X*'X*)“‘ does not exist, it is 
necessary to impose at least one linear constraint on the elements of |* in order to obtain 
a unique solution to the set of equations (6). (The choice of this linear constraint must 
be made carefully; in particular, statistical theory dictates that the corresponding popu¬ 
lation linear function of the elements of must be non-estimable [33].) 

A common practice in the literature has been to require that the estimates of two 
adjacent age, period, or cohort effects be numerically equal to one another (e.g. that 
«f = if), such a requirement supposedly stemming either from a priori conjectures and/or 
data-based impressions about the true underlying age, period, and cohort population 
effects. However, the choice of such a constraint on the parameter estimates generally has 
a major impact on the observed patterns in the estimated age, period, and cohort 
coefficients. As has been illustrated both with real data [17] and with artificial data [20,22], 
estimated patterns in age, period, and cohort effects typically vary dramatically as a 
function of the constraint employed, such variation often being so extreme as to prohibit 
any meaningful interpretation of the data under consideration. 

An example using the lung cancer data in Table 1 clearly illustrates this phenomenon. 
With Yij= in(iO^Ay) for these data, a seemingly very reasonable choice for a constraint 
based on etiologic considerations would be to require that the estimated age effects d* and 
d? for the first two age groups (namely, 30-34 and 35-39) be equal; i.e. we are imposing 
the linear constraint (df — d^) = 0. The underlying rationale for choosing this particular 
constraint is that it appears reasonable to assume that the young ages encompassed by 
these first two age groups would exhibit similar effects regarding the rare chronic disease 
in question; in other words, we are, in actuality, assuming that af = a* in the population. 
As we will demonstrate, if af A a*, then the vector %* obtained by solving the system of 
equations (6) under the constraint df = df will not be an unbiased estimator of the 
parameter vector 

When the constraint (df — d^) = 0 is used in conjunction with equation (5), and the 
parameters are estimated either by least squares via equation (6) or by Poisson regression*, 
the pattern in the estimated age effects is in the opposite direction to that expected (see 
Tables 3a and b). ln_particular, the estimated age effects decrease in value with increasing 
age, a pattern which is diametrically opposed to the well-documented principle that cancer 
risk increases with age. For comparison purposes, the results based on using the 
constraints 0* = and f* = y} are also presented in Tables 3a and b. The large 

fluctuations in the values of these estimates as a function of the chosen constraint are most 


^Poisson regression is a maximum liketihood-based parameter estimation procedure. The theoretical basis for 
its use is the assumption (in our notation) that t?,, has a Poisson distribution with mean with 

the true underlying rate in cell (i, J) and A; estimator of Two popular computer programs for 
fitting models like ln(x„) =+ 2 , ++ by Poisson regression methods are CATMAX [34] and 

GLIM [35]. For an excellent discussion of Poisson regression methodology, see Frome [36], 
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disturbing, especially when faced with the fact that the constraints used in Tables 3a and 
b are not any more unrealistic than other possible constraint specifications that could be 
made. 

With regard to the goodness of fit measures (i.e. R'^ and the deviance) given in Tables 
3a and b, it is typically the case when fitting by least squares the multiple classification 
model (3) to APC data that R~ values extremely close to 1 are obtained. However, this 
only means that the fitted model leads to excellent agreement between observed (and 
predicted {Y^j— fi.* -\-af + responses; such agreement promises nothing 

about the^ accuracy of the individual estimated effects constituting the conglomerate 


Table 3c. Estimated mc 


estimate Ty. 


Table 3a, Estimated model (3) effects as a function of constraint choicer method of estimation is unweighted least 

__ SQUARESt; DATA ANALYZED ARE IN TABLE 1 

___ (■;f-y?) = 0 _ 

__ Period _Cohort_Age Period Cohort Age Period Cohort 


(dr-iJI 

Age 

Peric 

-2.855 

-0.1. 

-2.S55 

-o.e* 

-1.018 

-O.Ct 

-0.277 

-0.0 

0.307 

-o.cw 

0.731 

0.0- 

1.009 

O.Oi 

1.136 

l.ll 

1.099 

0.1 

1.010 


1.713 



tThc acronym RSiE per^ 
procedure. In this exat 


/=l,2. 

means and '‘nuir effc' 
{For each of the three 


t^“ = 0 999 for each of these three sets of estimates. 

Table 3b. Estimated model (3) effects as a fuNcroN of constraint choice; method of estimation is Poisson regression 
____ (WLSIE)t; DATA ANALYZED ARE IN Ta5LE _ 

_ = o= o _ 

Age Period Cohort Aae Period Cohori 





(•;f-)'‘f) = 0 


Period 

Cohort 

Age 

Period 

Cohort 

0.466 

-4.426 

-1.-415 

-2.314 

1.828 

0.466 

-3.732 

-0.822 

-1.619 

1.828 

0,309 

-3.192 

-0.287 

-1.081 

1.673 

0.224 

-2,662 

0.139 

-0.471 

1.508 

0.073 

-2,073 

0.439 

0-073 

1.401 

-0.099 

-1.532 

0.S9S 

0.596 

1.248 

-0.298 

-0.980 

0.650 

1.092 

1.105 

-0.470 

-0.422 

0.559 

1.615 

0.968 

-0,671 

0.074 

0.350 

2.109 

0.769 


0.517 

0.085 


0.517 


0.888 

-0.296 


0.193 


fPoisson regression estimates are produced via a maximum likelihood-based procedure involving iieraiion from a set of initial 
estimates to a set of linal estimates; the acronym WLSIE pertains to the fact that “weighted least squares Initial estimates” 
were ussd to start the CATMAX [34] iteration procedure. 

+Thc standard goodness-of-fii statistic for Poisson regression is the dei'iunt'e, which is approximately equal to the “observed 
versus predicted” statistic. 

i («,, - ,'d„. 

where d„ is the predicicd count in cell (i, /) using the fitted model. For each of the three sets of estimates above, the deviance 
value was 95.682; this suggests some lack of fit when compared to x' tables with 63 df. 
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Table 3c. Estjmated model (3) effects as a function of constraint choice; method of estimation is Poisson regression 

(RSIE)t; DATA ANALYZED ARE IN TaBLE IJ 



(a»-eij) = 0 






II 

o 


Age 

Period 

Cohort 

Age 

Period 

Cohort 

Age 

Period 

Cohort 

-2.855 

-0-131 

0.270 

-2.99B 

-0.012 

-0.015 

-2.973 

-0.032 

0.031 

-2.855 

-0.099 

0.233 

-1.996 

-0.012 

-0.010 

-1.975 

-0.027 

0.03! 

-1.018 

-0.069 

0.20) 

-1.105 

-0.011 

-0.007 

-1,090 

-0.021 

0.029 

-0.2V 

-0.036 

0.172 

-0.335 

-0.008 

-0.003 

-0.325 

-0.014 

0.027 

0.307 

-0.003 

C.144 

0.278 

-0.005 

0,000 

0.283 

-0.005 

0.025 

0.731 

0.031 

0.116 

0.731 

0,001 

0.002 

0.731 

0,006 

0.023 

1.009 

0.066 

0.089 

1.038 

0.007 

0.005 

1.033 

0.017 

0.021 

1.136 

1.103 

0,062 

1.195 

0.016 

0.009 

M84 

0.031 

0.019 

1.099 

0,138 

0.036 

1,187 

0.024 

0.0)2 

1.172 

0.045 

0.018 

1.010 


O.OlO 

1.128 


0.014 

1.107 


0.014 

1.713 


-0.019 

0.877 


0.014 

0.853 


0.009 



-0.051 



0.011 



0.000 



-0.084 



0.007 



-0.008 



-0.116 



0.004 



-0.017 



-0.148 



0.000 



-0.025 



-0.177 



0.000 



-0.031 



-0.210 



-0.005 



-0.041 



-0.238 



-0.012 



-0,053 



-0.290 



-0.026 



-0.072 


tThe acronym RSIE pertains to the fact that “researcher-selected initial estimates” were used to start the CATMAX [34] iteration 
procedure. In this example, the initial estimates were as follows: 



Pj^\ 

i= 1,2 . a', pj* 0,y “ 1,2,y** = 0, k =* 1,2.+ p — 1. Comparable initial estimates (e.£. age-specific marginal 

means and "null” effects for period and cohort) have been used by other researchers [3, 12]. 
iFor each of the three sets of eslimates above, the deviance value w'as larger than 10^ 


\ As mentioned in a footnote to Table 3b, the deviance reflects discrepancies between 

observed (£?y) and predicted (t?y) cell counts, and hence can be large even when R‘ is 
essentially equal to 1 in value (as can be seen in Tables 3a and b). The deviance has been 
criticized by some researchers [3] as being too sensitive to departures from the fitted model 
when large populations at risk are under study. These researchers have suggested the use 
of an alternative measure of fit for such count data; this ad hoc statistic is similar in 
j structure to and generally produces comparable values. 

! One disturbing, but not typically appreciated, characteristic of the deviance measure 

i.ly=i 

is that it is not invariant with respect to scale changes in Oy (and hence in (3y). As an 
example, the Oy values in Table 1 are averages per year, so that the person-year values 
(calculated as fVy = Oy/^y) are also on a per-year basis. If we had worked instead with 
5-year totals of deaths and person-years, the deviance value would have been approxi- 
’ mately five times larger than that given in Table 3b, In contrast, a scaling up of each Oy 
value and each Ay value by a factor of 5 leaves Ay, and hence R^, unaffected. 

Finally, a comparison of the results in Tables 3b and c highlights another alarming 
phenomenon. In particular, it is an unfortunate circumstance that the final parameter 
estimates obtained via an iterative maximum likelihood computer program (e.g. as used 
to generate the Poisson regression analysis results given in Tables 3b and c) depend on 
the values of the initial estimates used to start the iteration process. The underlying reason 
for the discrepancies among sets of parameter estimates based on different starting values 
for the iterations is that the iteration process often does not search the entire likelihood 
surface for the desired “global” maximum; instead, the iteration stops at a “local” 
maximum close to the set of initial estimates (which is apparently what happened in Table 
3c). Although this undesirable phenomenon has been mentioned in the literature [3], the 
strong possibility of obtaining incorrect parameter estimates based on the use of such 



« 

t 
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likelihood surface search routines does not seem to have detracted from the popularity of 
such iteration algorithms. 

Returning to our discussion of the results in Tables 3a and b, it can be shown that 
estimation bias is the primary reason why patterns in estimated age, period, and cohort 
effects vary so much as a function of the choice of the additional linear constraint which 
must be placed on the elements of in order to fit model (5). A Searle [33] points out, 
the elements of will be unbiased estimators of the corresponding elements of %* only 
if the chosen constraint actually holds among the true (but unknown) population 
parameter values. When the chosen constraint does not hold perfectly in the population 
(a situation which almost always exists), the extent of the estimation bias attendant with 
the use of any linear constraint, like (af—aj) = 0, will depend on how much the 
corresponding population linear function, like (af — a^"), differs from zero. 

Theorem 3.2 in Kupper et al. [28] characterizes the exact structure of the bias in as 
a function of the “amount” by which the chosen constraint is in conflict with the true 
(unknown) population structure. Appendix B discusses this theorem and provides an 
example illustrating its use. The important general implication of Theorem 3.2 [28] is that 
the choice of constraint is the crucial determinant of the accuracy in the estimated age, 
period, and cohort effects based on model (5). 

As mentioned earlier, AFC researchers have employed two general strategies when 
searching for an appropriate constraint. The first method involves examining several sets 
of estimated coefficients, each set obtained via the use of a different constraint supposedly 
chosen based on a priori suppositions about the population under study. The typical 
impression gained from such an examination is that there is considerable variation in 
estimated coefficient values from set to set, and that a definitive choice as to which set of 
coefficients (if any) is most reliable is often difficult, if not impossible, to make. 

Another popular approach for choosing a constraint involves a preliminary descriptive 
examination of patterns in the data to be analyzed. This data-based method utilizes certain 
observed trends (e.g. in the crude age-specific means) to suggest a possibly realistic 
constraint or to start a maximum likelihood iteration process for parameter estimation 
[3,12]. Kupper et al. [28] discuss in theoretical terms why such data-dependent procedures 
can be quite misleading, and their position has been supported by Holford [7] and Rodgers 
[22,23]. 

In theory, the following scenario describes about the only situation in which an APC 
analysis using model (5) might be informative. Suppose that several independent linear 
constraints on the elements of can be specified, where each such constraint has strong 
justification on purely theoretical grounds (and not on hints obtained from a qualitative 
inspection of the observed data). If the separate sets of estimates obtained by applying each 
of these various theoretically-based constraints to model (5) are in quite close agreement 
with one another, then one might have some confidence in the reliability of this common 
set of estimated age, period, and cohort effects. 

In reality, however, the above desirable situation very rarely occurs, and the APC data 
analyst is usually faced with choosing one particular set of estimated coefficients from 
several sets in which the coefficient estimates vary, often dramatically, from set to set. The 
final choice of one particular set of estimated coefficients (or, equivalently, of one 
particular constraint) is often a very subjective one, with preference given to estimated 
coefficient patterns which seem to support, at least approximately, certain a priori 
suspicions about relationships among the particular age, period, and cohort factors under 
Study. Such subjective data-analysis decisions leave room for considerable controversy 
[22,23,25]. 

The estimated coefficients ultimately reported are often accompanied by their estimated 
“standard errors” [3, 37, 38]; such standard errors are available automatically as part of 
the computer output from popular model fitting programs (e.g. unweighted and weighted 
least squares, and Poisson regression packages). We hasten to discourage the use of such 
standard errors to construct confidence intervals for the age, period, and cohort effect 
parameters of interest under model (5). Such standard errors completely ignore the 
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potentially large bias in the estimated coefficients. Since bias, and not variance, is the 
overwhelming source of discrepancy between estimated and true coefficient values, the use 
of these variance-based standard errors will create a misleading impression about the 
“closeness” of these estimates to Che true population values. An area of needed statistical 
research would involve the development of realistic error bands for the estimation of APC 
coefficients using model (5); such error bands would have to take into account the bias 
in estimated effects due to inappropriate constraint specification, A detailed simulation 
study would almost certainly be needed, but such a difficult research effort has not yet been 
pursued in any productive way. 

4.2. Two-factor models 

The discussion thus far has focussed on statistical issues concerning accurate estimation 
of the parameters in mode! (5), a model involving all three of the factors age, period, and 
cohort. One possible way to avoid the identifiability problem attendant with the use of 
model (5) is to argue that one of the three factors (e.g. period) is “unimportant”, in which 
case a two-factor (e.g. an age-cohort) model could reasonably provide a valid description 
of the data. 

The use of such two-factor models to describe APC data has become quite popular in 
recent years [3,10,11, 17, 38, 39,41,42]. However, the statistical methodologies employed 
to demonstrate the “unimportance” of one of the three factors can be seriously misleading 
(as we will presently illustrate by example), and it is our position that published analyses 
based on the use of two-factor models (most typically of the age-cohort variety) are 
potentially in error. 

In general, the strategy typically employed for choosing a two-factor model over model 
(5) is as follows. First of all, each of the three two-factor models is fit to the data (e.g. 
by least squares or Poisson regression methods). Next, measures of “goodness of fit” of 
these models (e.g. R- when using least squares, or the deviance [36] when using Poisson 
regression) are computed and then compared to the corresponding measure obtained by 
fitting model (5) to the data.* That two-factor model which best fits the data, and whose 
fit is not significantly different from that of the three-factor model, is then reported as the 
most appropriate mode!, (Of course, if none of the two-factor models provides a good fit 
to the data, then the three-factor model analysis, with its accompanying identifiability 
problem, would have to be pursued.) 

Kupper ei al. [28] have demonstrated both theoretically and by example that the above 
two-factor model selection strategy can be seriously misleading if one of the three sets of 
effects (age, period, or cohort) conforms to a true underlying linear pattern. In that 
situation, measures of fit like R - and the deviance for the three-factor model (5) will turn 
out to be identically equal in value to the and/or deviance values for that particular 
two-factor model not involving that specific set of effects satisfying a linear relationship. 
Without giving a formal mathematical argument, the underlying reason for this disturbing 
phenomenon is that the R~ and deviance values based on fitting model (5) are the same 
regardless of the orientation of the linear pattern (since such measures of fit are invariant 
with respect to the choice of constraint, the determinant of that orientation), and one such 
orientation is the horizontal one indicating no non-zero effects for that factor. 

The following numerical example provides a vivid illustration of the troublesome 
situation described above. The hypothetical data to be considered appear in Table 4a and 
b, and the results of various revealing Poisson regression analyses of these data are 
presented in Table 5. (Poisson regression-based parameter estimates were identical to those 
obtained by unweighted least squares, and so the latter results will not be reported.) 


*It is important to note that measures of fit like R ■ and the deviance based on fitting model (5) are invariant 
with respect to the choice of constraint [33], so that the values of test statistics for assessing the significance 
of changes in R- and/or deviances in going from two-factor to three-factor models are the same regardless 
of the specific form of constraint used to fit (S). However, statistics [37, 40] which depend on the estimated 
values of individual coefficients under model (5). and hence vary with the choice of constraint, should not 
be used. 
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Table 4a. Table of hypothetical rates per 100,000 (i.e. 10^ lton 

WHICH THE RESULTS IN TABLE 5 ARE BASED 





Period group {J) 


/ = l 

J = 2 

2=3 

7=4 

7=5 


i = 1 

1.73 

1.S2 

1.82 

1.73 

1.65 

. 

i=2 

1.97 

2.12 

2.23 

2.23 

2.21 


i^3 

2.J0 

2.39 

2.56 

2.69 

2.69 


/=4 

2.17 

2.52 

2.87 

3.08 

3.24 


( = 5 

2.08 

2.53 

2.94 

3.35 

3.60 


1 = 6 

1.79 

2.29 

2.80 

3.25 

3.71 


i = 7 

1.51 

1.94 

2.50 

3.05 

3.54 

Table 4b. 

Table of 

HVPOTHETICAL VALUES OF O.. 

UPON 

THE 


RESULTS IN Table 5 are BASEof 






Period group ij) 




j = 1 

/ = 2 

y = 3 

7=4 

7=5 


/ = 1 

8650 

9100 

9100 

8650 

8250 


1 = 2 

9850 

10,600 

11,150 

11,150 

10,600 

Ag« 

i = 3 

10.500 

11,950 

25,600 

26,900 

26,900 

group 

1-4 

10,850 

25,200 

28,700 

30,800 

32,400 

U) 

i = 5 

!l),400 

25,300 

29.400 

33,500 

36.000 


i=6 

8950 

11,450 

28,000 

32,500 

37»100 


1 = 7 

7550 

9700 

12,500 

30,500 

35,400 


tThe hypothetical values of used in the Poisson regression analyses 
of Tabic 5 are obtained from the appropriate entries in Tables 4a and 
b via the formula Nff~ 0^fiR^f. 

The rates in Table 4a were generated via model (3) as 
ln(10^4.) = ^* + o:t + p*+ 

where fi* = (a + ^ + y)-, the values of these parameters are listed in the first column of 
Table 5 (for simplicity, we have assumed that fi —0 and that 8,j = 0). As an example, when 
i =j ^ 1, then 

ln(10’R,.,) = (0.220+ 0.300+ 0.280) -0.220 -0.200 +0.170 = 0.550 

so that 

10% = = 1.73 

The entries in Table 5 illustrate numerically the following principles: (i) when fitting 
mode! (3), the use of a constraint (namely, df = d*) which actually holds in the population 
(namely, otf = of^) yields unbiased estimators of the age, period, and cohort population 
effects (cf. columns {1} and {2}); (ii) the bias in the parameter estimators using model (3) 
varies considerably with the choice of constraint (cf. columns {2}, (3} and {4j); (iii) the 
deviance and statistics based on fitting model (3) are invariant with respect to the choice 
of constraint employed (cf. deviance and J?" values for columns {2}, {3}, and {4}); and. 
(iv) when one set of population effects follows a straight line pattern (in this example, it 
is the set of period effects), then equating any two such estimated effects under model (3) 
“rotates” the (estimated) linear pattern to a horizontal position, creating the false 
impression that such a factor is “unimportant” (see column {3}). 

With regard to principle (iv), an examination of the three two-factor model sets of 
estimates and their deviance and 7?* values (columns -{5), {6}^ and {7}) leads to the 
conclusion that the fitted age-cohort (A-C) model would be chosen as the “best” model 
(along with its severely biased set of effect estimates). What this means in practice is that 
a non-significant improvement in fit over a two-factor model when using model (3) does 
not allow one to distinguish between the situation where there are really no non-zero 
population effects for the added third factor (i.e. the horizontal linear orientation) and the 
situation where the third factor population effects follow a aon-horizontal linear re¬ 
lationship (thus suggesting an important role for that factor). If the latter situation actually 
exists, then the chosen two-factor model v.'ill produce a very misleading impression of the 
true patterns in the corresponding age, period, and cohort population effects (again, see 


Table 5. As iliustrat 

PERIOD. AND COHORT Eh 


True APC 
parameter 
values fCol. {1}) 

Age effecis {a = C.220> 

ay* -0.220 
aj = -0.120 
«?= -0.030 
a? *0.055 
a? = 0.]10 
af = 0.110 
a? = 0.095 

Period effects 0 — 0.3 
- 0.200 
= -O.lOO 
/?? = 0 
= 0.100 
j9 ^ = 0.200 

Cohort effects (v = 0.^ 
-0.280 
yj = -0.130 
Vj = 0.020 
7 ? = 0.120 
Vj = 0-170 

vj = 0.200 
V,*= 0.170 
y? = 0.J20 
>-,* = 0.020 
-0.130 

yfj - -0.280 
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Table 5. An illustration of the fluctuation in numerical estimates, obtained via Poisson regression [34], of the age. 

PERIOD. AND COHORT EFFECTS IN MODEL (3) AND IN CERTAIN TWO-FACTOR MODELS AS A FUNCTION OF THE CHOICE OF CONSTRAINT; 

_= 0 AND = 0 FOR simplicity; data Analyzed are in Tables 4a and b_ 


True APC 
parameter 
values (Ctsl. {!}) 



Parameter 

estimates 


1 



Full APC model (3) 



Two-factor models 


Col. (2) 

Col. |3i 

0 

II 

A-P 
Col. (5} 

A-C 

Col. {6} 

p-c ! 

Col. {7} 1 

Age cfTects (a = 0.220) 








af = -0.220 

-0.220 

-0.520 

-0.310 

-0.340 

-0.520 



-0.120 

-0,120 

-0.320 

-0.180 

-0.142 

-0.320 



sf = -0.030 

-0.030 

-*0.]30 

-0.060 

-0.007 

-0.130 


,T 

scj = 0,055 

0.055 

0.055 

0.055 

0.119 

0.055 


... - 

«j = o.no 

0.110 

0.210 

0.140 

0.170 

0.210 



*,* = 0.110 

0.110 

0.310 

0.170 

0.140 

0.310 



aj = 0.005 

0.095 

0.395 

0.185 

0.060 

0.395 


1 ■ 

Period effects = 0.300) 








-0.200 

-0.200 

0 

-0.140 

-0.255 


-0.297 

1- 

P,*= -0.100 

-0.100 

0 

-0.070 

-0.100 


-0.144 

■ 

fir^o 

0 

0 

0 

0.030 


-0.001 


= 0.100 

0.100 

0 

0.070 

0.128 


0.144 


= 0.200 

0.200 

0 

0.140 

0.197 


0.298 


Cohort elTecls (f = 0.280) 








yr= -0.280 

-0.280 

-0.780 

-0.430 


-O.780 

-0.082 


y}= -0.130 

-0.130 

-0.530 

-0.250 


-0.530 

0.050 


Vf = 0,020 

0.020 

-0.280 

-0.070 


-0.280 

0.178 


= 0.120 

0.120 

-0.080 

0.060 


-0.080 

0.235 

.- 

71^ = 0.170 

0.170 

0.070 

0.140 


0.070 

0.241 

« _ 

Vf = 0.200 

0.200 

0.200 

0.200 


0.200 

0.243 

•.= 

yf = 0.170 

0.170 

0.270 

0.200 


0.270 

0.162 

- = 

y,* = 0,120 

0.120 

0.320 

0.180 


0.320 

0.049 


= 0.020 

0.020 

0.320 

0.110 


0.320 

-0.126 


yr,= -0.130 

-0.130 

0.270 

-0.010 


0.270 

-0.361 


yfi = -0.2S0 

-0.280 

0.220 

-0.130 


0.220 

-0.589 


Deviance: 

0.483 

0.483 

0.483 

5329 

0.483 

104! 


df: 

15 

15 

15 

24 

18 

20 

__ 

R- values; 

I 

1 

1 

0.985 

1 

0.998 



columns {1}, {3}, and {6} in Table 5). These disturbing results suggest that APC analysis 
procedures which hinge on the use of two-factor models [10, 11] have the potential to 
I produce misleading conclusions. 

I 5. DISCUSSION 

I 

; The presentation in the previous sections focussed on statistical APC analysis using the 

P popular multiple classification model (3). The identifiability problem attendant with the 
: use of model (3) was discussed theoretically, and its deleterious impact upon the 

I interpretation of APC data analysis efforts was illustrated numerically both with real and 

{ with hypothetical data. 

I The strategy of circumventing this identifiability problem via the use of a “good-fitting” 

I two-factor model was shown lo be potentially misleading when one of the three factors 

(age, period, or cohort) has an underlying population effect pattern which is essentially 
; linear. In particular, the fact that a two-factor or three-factor APC model fits the data well 

I (i.e. there is good agreement between observed and predicted responses as measured by R~ 

■ or the deviance) does not allow one to conclude that individual coefficients in that model 

I have been estimated with validity. Indeed, an inspection of Table 5 reinforces the 

' disturbing fact that good fitting APC models can involve estimated effects which are 

I seriously biased. 

j Based on the above considerations, it is our opinion that valid statistical APC analyses 

are exceedingly difficult, if not impossible, to carry out when the underlying model to be 
fit is of the multiple classification form (3). What, then, are some viable approaches, if any, 
for producing a valid statistical modeling analysis of APC data? Certainly, the most direct 

( way to avoid the identifiability problem is to consider using a model form which is different 
in basic structure from that of model (3). Various approaches involving alternative model 
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forms have been considered in the literature [2, 4, 6,7,11, 12,41], some of which may be 
“different enough” from model (3) to avoid the identifiability problem. However, none of 
these approaches or models has received anywhere near the statistical scrutiny given to the 
multiple classification model (3). Until comparable evaluations are made about these 
alternative methodologies, judgment must be withheld regarding their utility for producing 
valid analyses of APC data. 

The modeling controversy aside, an even more basic problem with APC analyses 
concerns the characteristics of the data used in such analyses. Firstly, it is well known that 
the accuracy of time-related patterns in mortality and morbidity rates can be severely 
compromised by a number of possible sources of error (e.g. diagnostic errors, changes in i 
diagnostic procedures over time, errors in determining person-years at risk, etc.). In I 

modeling trends in such rates over time as a function of the age, period, and cohort factors, .' 

it is necessary to realize that such modeling exercises (indeed, any statistical analyses) are 
futile if the data base itself is unreliable. 

As mentioned quite early in this paper, one unique characteristic of APC data is that 
the number of rates associated with each of the various cohorts varies from 1 to min(a, 
p) as one moves from the extreme diagonals to the middle diagonalfs) of an age-by-period 
two-way layout (again, refer to Table 1). A re-casting of the data in Table 1 into an 
age-by-cohort two-way display (see Table 6) dramatically illustrates this variation in 
cohort-specific data. Loosely speaking, each empty cell in Table 6 can be looked upon as 
a “missing observation” in the sense that no cohort is observed over the entire range of 
age categories (nor over the entire range of period categories). Indeed, two birth cohorts 
in Table 6 (namely, the 1851 and 1941 cohorts) are associated with only one age group 
each, two cohorts (namely 1856 and 1936) involve only two age groups each, etc. Overall, | 
110 of the 209 cells in Table 6 (i.e. 52% of the cells) are empty. 1 

Researchers have recognized this problem and have attempted to adjust for it in various 
ways (e.g. by weighting each cohort coefficient inversely with the number of observations 
specific to that cohort [27], or by discarding estimates of cohort effects involving small 
numbers of observations [3]). Such adjustments are basically ad hoc in nature, and do not 
justify extrapolation of estimated trends to future cohorts and future periods. As with any 
statistical modeling analysis, only prediction within the range of age, period, and cohort 
categories actually observed is permissible; and, even then, such interpolation is question- | 
able when it involves those cohorts containing few observations. 

In summary, the statistical analysis of APC data is plagued by many unresolved issues 
and potential sources of error. First of all, the selection of the response function Y^=f (/?,;) 
to be modeled can have a significant influence on patterns in estimated age, period, and ; 
(especially) cohort effects. Moreover, even if specification of a meaningful function / {R^|) 


Table 6. Luno cancer mortality data of Table 1 arranOep by central year of birth and by age group at death 


Central 





Age group at death 
















80-84 

birth 

30-34 

35-39 

40-44 

45-49 

50-54 

55-59 

60-64 

65-69 

70-74 

75-79 

1851 











12.47 

1856 










19.09 

23,49 

1361 









20.77 

33.59 

31.46 

1866 








20,73 

32.98 

44.96 

64.03 

1871 







19.6! 

38.22 

50.36 

77.73 

96.88 

1876 





. - 

16.14-' 

34.75 

54.29 

85.62 

115.21 

132.16 

1881 





13.04 

30.07 

53.65 

87.47 

128.85 

160.81 

179.54 

1886 




8.19 

21.76 

47.06 

87,38 

136.24 

184.99 

224.70 

260.91 

18P1 



4.51 

13.72 

30.35 

68.18 

120.90 

L89.li 

245.18 

318.68 

360.16 

1896 


2.3S 

6.78 

17.62 

43.32 

88.37 

158.32 

240.25 

322.38 

415,37 


1901 

1.08 

3.06 

8.46 

22.31 

53.82 

108.86 

189.36 

289.07 

390.94 



1906 

L.43 

3.66 

10.05 

27,46 

64.29 

121.86 

218.81 

325.13 




1911 

1.64 

4.10 

11.75 

30.75 

72.11 

139.46 

238.48 





1916 

1.58 

4.32 

13.40 

36.16 

81.43 

151.86 






1921 

i.52 

5.26 

16.07 

41,06 

86.42 







1926 

1.96 

6,35 

19.60 

47.51 








1931 

2.16 

7.25 

19.87 









1936 

2.14 

7.21 










1941 

1.74 








- 
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can be made, the choice of constraint on the elements of in model (5) required to fit 
the multiple classification model (3) has an even more dramatic impact on such estimated 
effect patterns. Unfortunately, such a constraint choice must be based on a priori 
asstimptions whose validity cannot be demonstrated empirically with the data under 
analysis. The extent of the bias in estimated effects resulting from the use of a constraint 
which almost certainly does not hold exactly in the population under study cannot be 
accurately measured; also, such bias invalidates the use of variance-based standard errors 
for assessing the “accuracy” of such estimated eifects. 

The adoption of a two-factor model based on standard goodness-of-flt criteria is invalid 
when the population effects for one of the factors (age, period, or cohort) follow a 
non-horizontal linear pattern. Hence, to attempt to avoid the identifiability problem by 
documenting (via statistical testing procedures based on changes in deviance and 
values) the existence of a “good-fitting” two-factor model can be a seriously misleading 
strategy in the sense that an important factor may be erroneously discounted. Finally, even 
if a fitted APC model can be found which “appears” reasonable, the use of such a model 
for prediction (e.g. about present or future cohort effect patterns) is severely limited by 
the lack of cohort-specific data. 

Given these potential sources of error attendant with currently available APC statistical 
modeling procedures, it is our position that such regression methods cannot be said to 
provide important interpretational advantages over traditional graphical approaches. It is 
possible that such modeling procedures may constitute part of a general “accounting 
framework” [25,43], but the potential for researcher bias is great because of the need for 
a priori assumptions concerning underlying population relationships among the variables 
under study. 

With regard to future research efforts, we recommend that further work be done to 
develop and evaluate (in a rigorous statistical sense) innovative procedures which by-pass 
the identifiability problem, but which still provide relevant information regarding age, 
period, and cohort effect patterns, Since such development and evaluation has not yet 
taken place, we are forced to conclude that the current state-of-the-art of statistical APC 
analysis is still in its infancy. 
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It thus follows that the (12 x 11) matrix X* has the structure 

' 1 1 0 1 0 0 0 0 

1 1 0 0 1 0 0 0 

1 1 0 0 0 1 0 0 

1 1 0 -1 -1 -1 ~1 -1 

1 0 1 1 0 0 0 1 

10 10 10 0 0 

10 10 0 10 0 

1 0 1 -1 -1 -1 0 0 

1-1-1 1 00 1 0 

I -1 -1 0 1 0 0 1 

1 -1 -1 0 0 1 0 0 

I -1 -1 -1 -1-1 0 0 


1 

0 

0 

-1 

0 

1 

0 

0 

0 

0 

1 

0 


0 

1 

0 

-1 

0 

0 

1 

0 

0 

0 

0 

1 


0 

0 

1 

-1 

0 

0 

0 

1 

0 

0 

0 
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Although it is certainly not obvious based on a brief inspection, this matrix X+ is actually one less than full 
rank; in other words, there is an exact linear dependency among the columns of X*. To see this, let us equivalently 
represent X* by its columns as 


X* = (l; Af, AJ; Bf, Bf, BJ; Cf, Cf, Cf, CJ, C^), 


so that, for example, 1 is the (12 x 1) column vector of ones, Af' = (l, 1, 1. 1, 0, 0, 0, 0, —1, —1, —1, — 1), 
etc. Then, the interested reader can verify that the following linear relationship holds among the columns of X*: 

-Af-l-SBf 4-lBJ-iBr-fcr-|Cr-iC3*+iCJ + 3cf = 0. (A.!) 

To check that equation (A.. 1) holds requires a row-by-row inspection of X*. 

For general a and p, the column representation of X* under model (5) is 

X* = (1; M . K- n Bf .B;. cr .(A.2) 

Kupper et al. ([28], Theorem 3.1) have shown that the general expression for the linear constraint holding among 
the columns of the (ap) x [2(a + ;?) - 3] matrix (A.2) is as follows; 


(fl + !)■ 


n r - > r 

A*- I j- 

J'-l _ 


[p + !)■ 


a + p — 1 

18*+ i 


k - 


(a +P) 


Cf =0. 


(A.3) 


It is easy to show that equation {A.3) reduces to equation (A.l) when a = 3 and p = 4. 

The form of equation (A.3) is revealing. Since the orthogonal polynomial coefficients for assessing the linear 
effect of an equally-spaced variable with I levels are given by the values of 




for ( = 1,2./, equation lA,3) says, loosely speaking, that “(the linear component of the age columns) — (the 

linear component of the period columns) + (the linear component of the cohort columns) equals zero.'* This result 
reflects the fact that it is the imear effects of the age, period, and cohort categorical variables (in contrast to their 
higher-order effects) which are inextricably mixed, and hence are not individually identifiable. 


APPENDIX B 

Equation (A.3) in Appendix A can be compactly written in matrix notation as 


) 


i 


I 

I 

f 


X*v = 0 


where X* is given by (A.2) and where 
(a + l) , (cH-l) 




2 - 


. 


ip - 1 - 1 ) 


2 ’ 2 ''2 

(a+p) (c-l-p) 




. ia+p-2)- 


(a +p)1 


(B.l) 


Now, let 

Ir=(p*; sr. ■ -. - /?r, • ■ ■, n, 

denote a set of estimates of the elements of 


^‘' =(,.•; ..., otj. Pf,.. K,... 


based on using the constraint c'^* = 0, where c' is an appropriately chosen I x [2(c + p) — 3] row vector of 
constants dehriing the constraint of interest. For example, when 


c' = (0, I, -i, 0, 0, ...,0), 

then c't* = (af — if) = 0, so that 4* is that vector of estimates obtained by forcing the estimates of the first two 
age effects to be equal (i.e. by requiring that if = if). Recall that this particular constraint was employed to 
obtain the sets of estimates given in Tables 3a, b and c using the lung cancer mortality data in Table 1. 


I 
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Kupper et al, ([28], Theorem 3.2) have shown that the bias in when used to estimate under model (5) 
has the specific structure 

Bias(i») = £(i'»)-^* = A:v (B,2) 

where v is defined in equation (B.l) and where 

k — — c'E*/c'v; (B.3') 

note that c't 0 since c'4* is required to be non-estimable. 

il is clear from equations (B.2) and (B.3) that the elements of 4* wiil all be unbiased estimators of the 
corresponding elements in under model (5) when A = 0, or equivalently, when c'^* = 0, In other words, 
= %* when the constraint c'|* = 0 employed to obtain the vector of estimates 4? actually holds in the 
population (i.e. c'4* = 0). [Under model (5), 

s(“?)= - z' £(«,*). 

/-■ I 

with similar expressions holding for £(/?*) and £(f^ 4 .p_ J. In general, v, c, and 4* change in definition depending 
on which particular age, period, and cohort parameters are removed using the set of restrictions (4), but the values 
of the biases based on equations {B,2) and (B.3) do not.] 

As an example, consider again the constraint 

where 


c' = (0, 1. -1, 0, 0....,0). 


Using T as defined in equation (B.l), we then have 






c'v — 

‘ 2 


2- 

2 


and 


so that A = (a f — af) and 


For example. 


e'4* = (af — a*). 


Biasdf) = (a * — «*)v. 


Biasdff) = E{jfr) - = (af - af) 


\P + 1) 
2 



Thus, unless an element of v is zero (in which case the parameter corresponding to that element will be 
unbiasedly estimated under the assumed model (5) regardless of the constraint c'l)' = 0 which is chosen), the value 
of the bias will be a function both of the sign and magnitude of the true (and unknown) parametric difference 
(af — aj). In particular, the bias in a particular effect estimator could be positive or negative depending on the 
(unknown) sign of (af — af), and the magnitude of the bias will depend on the (unknown) value of |af — af |. 
It is no surprise, then, that different choices for a constraint can lead to widely different sets of estimated effect 
values (again, refer to the numerical results in Tables 3a, b and c). 

Finally, expressions (B.2) and (B.3) lead to a very simple and informative condition for determining which 
linear functions of 4* are estimable (i.e. are estimated with no bias). In particular, if is some linear function 
of 4* of interest [e.g. 1'4* = a f when 1' = (0, 1, 0, 0,, 0)], then it follows directly from equations (B.2) and 
(B.3) that 

Biasd'lf) = £{l'|f) -1'4» = IX(4f) - 1 %* = r(kv) = kl'v, 

so that 


Biastr^f) = 0 when I'v = 0. 


The condition Tv — 0 provides a very simple check for estimability, as opposed to the more complex partitioned 
matrix-based conditions given by Holford [7], Asa simple example, af is not estimable since, from equation (B.l), 


rv = {0, I, 0, 0.0)v = 


(a 4- !)• 


7 ^ 0 , 


u > 1. 


More generally, since v in equation (B.l) involves the orthogonal polynomial coefficients for the linear effects 
of the age, period, and cohort factors, il Follows that I'v — 0 by definition when I consists of the orthogonal 
polynomial coefficients for quadratic and higher-order effects. 

The above simple proof that it is the non-linear effects of the age, period, and cohort factors which are 
estimable has been demonstrated via more complex arguments [17], Holford [7] has made use of this result to 
estimate these non-linear effects using orthogonal polynomials. However, to relate his non-linear effect estimates 
to the parameters of interest in model (5) still necessitates making, as Holford acknowledges, a generally 
unverifiable assumption about the values of one or more parameters (i.e. the identifiability problem has not been 
avoided). As we have demonstrated, when such an assumption does not hold, there is strong potential for 
obtaining seriously biased estimates of the elements of 4* ih model (5), 
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